Add `Hubert` to the `AutoFeatureExtractor` #13366

anton-l · 2021-09-01T10:12:18Z

Quick fix to allow Hubert models to auto-load Wav2Vec2FeatureExtractor.

Caught this while trying to load Hubert without an explicit feature extractor in pipeline("audio-classification")

src/transformers/models/hubert/__init__.py

patrickvonplaten · 2021-09-01T12:52:35Z

src/transformers/models/hubert/__init__.py



 _import_structure = {
+    ".wav2vec2.feature_extraction_wav2vec2": ["Wav2Vec2FeatureExtractor"],


IMO - this is ok. What do you think @sgugger ? In short we need to make HuBERT work with AutoFeatureExtractor and it uses the exact same feature extractor than Wav2Vec2. Either we import Wav2Vec2 here or we add a hack to how feature extractors are loaded in models/auto/modeling_auto_feature_extractor.py WDYT?

This is less hacky than what we did with MT5. Works for me.

sgugger

Not super happy with the result but this is the simplest we got with the current way the Auto API is implemented. We could think of a way to deal with those duplicates processors/tokenizers in the future, if we are more use cases like this one.

Thanks for the PR!

sgugger · 2021-09-01T13:05:34Z

src/transformers/models/hubert/__init__.py



 _import_structure = {
+    ".wav2vec2.feature_extraction_wav2vec2": ["Wav2Vec2FeatureExtractor"],


This is less hacky than what we did with MT5. Works for me.

Add Hubert to the auto feature extractor

a4c0a09

anton-l requested a review from patrickvonplaten September 1, 2021 10:12

Fix import structure

b61ee66

patrickvonplaten requested review from LysandreJik and sgugger September 1, 2021 12:50

anton-l commented Sep 1, 2021

View reviewed changes

src/transformers/models/hubert/__init__.py Show resolved Hide resolved

patrickvonplaten reviewed Sep 1, 2021

View reviewed changes

sgugger approved these changes Sep 1, 2021

View reviewed changes

anton-l merged commit 2406892 into huggingface:master Sep 1, 2021

anton-l deleted the fix-hubert-pipeline branch September 8, 2021 21:06

anton-l mentioned this pull request Oct 15, 2021

Add the SEW and SEW-D speech models #13962

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `Hubert` to the `AutoFeatureExtractor` #13366

Add `Hubert` to the `AutoFeatureExtractor` #13366

Uh oh!

anton-l commented Sep 1, 2021

Uh oh!

Uh oh!

patrickvonplaten Sep 1, 2021

Uh oh!

sgugger Sep 1, 2021

Uh oh!

sgugger left a comment

Uh oh!

sgugger Sep 1, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants



		_import_structure = {
		".wav2vec2.feature_extraction_wav2vec2": ["Wav2Vec2FeatureExtractor"],

Add Hubert to the AutoFeatureExtractor #13366

Add Hubert to the AutoFeatureExtractor #13366

Uh oh!

Conversation

anton-l commented Sep 1, 2021

Uh oh!

Uh oh!

patrickvonplaten Sep 1, 2021

Choose a reason for hiding this comment

Uh oh!

sgugger Sep 1, 2021

Choose a reason for hiding this comment

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

sgugger Sep 1, 2021

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add `Hubert` to the `AutoFeatureExtractor` #13366

Add `Hubert` to the `AutoFeatureExtractor` #13366